Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 8552 |
| Missing cells | 1781 |
| Missing cells (%) | 1.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 202.9 B |
Variable types
| Categorical | 7 |
|---|---|
| Text | 5 |
| Numeric | 8 |
| DateTime | 1 |
VoteAverage is highly overall correlated with weighted_average | High correlation |
VoteCount is highly overall correlated with Budget and 1 other fields | High correlation |
Budget is highly overall correlated with VoteCount and 1 other fields | High correlation |
Revenue is highly overall correlated with VoteCount and 1 other fields | High correlation |
weighted_average is highly overall correlated with VoteAverage | High correlation |
OriginalLanguage is highly overall correlated with North America and 1 other fields | High correlation |
North America is highly overall correlated with OriginalLanguage | High correlation |
Asia is highly overall correlated with OriginalLanguage | High correlation |
Oceania is highly imbalanced (83.9%) | Imbalance |
South America is highly imbalanced (90.5%) | Imbalance |
Africa is highly imbalanced (93.4%) | Imbalance |
TagLine has 1781 (20.8%) missing values | Missing |
Budget has 3604 (42.1%) zeros | Zeros |
Revenue has 3247 (38.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-01 22:58:34.660120 |
|---|---|
| Analysis finished | 2023-11-01 22:58:52.873773 |
| Duration | 18.21 seconds |
| Software version | ydata-profiling vv4.6.1 |
| Download configuration | config.json |
OriginalLanguage
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 6535 | |
| 0 | 2017 | 23.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 6535 | |
| 0 | 2017 | 23.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6535 | |
| 0 | 2017 | 23.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6535 | |
| 0 | 2017 | 23.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6535 | |
| 0 | 2017 | 23.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6535 | |
| 0 | 2017 | 23.6% |
OriginalTitle
Text
| Distinct | 8319 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
Length
| Max length | 104 |
|---|---|
| Median length | 59 |
| Mean length | 15.435454 |
| Min length | 1 |
Characters and Unicode
| Total characters | 132004 |
|---|---|
| Distinct characters | 1846 |
| Distinct categories | 20 ? |
| Distinct scripts | 14 ? |
| Distinct blocks | 20 ? |
Unique
| Unique | 8104 ? |
|---|---|
| Unique (%) | 94.8% |
Sample
| 1st row | Inception |
|---|---|
| 2nd row | Black Widow |
| 3rd row | The Matrix |
| 4th row | 정이 |
| 5th row | Trolls World Tour |
| Value | Count | Frequency (%) |
| the | 2153 | 9.2% |
| of | 603 | 2.6% |
| a | 288 | 1.2% |
| 2 | 231 | 1.0% |
| in | 194 | 0.8% |
| and | 190 | 0.8% |
| 178 | 0.8% | |
| to | 150 | 0.6% |
| la | 113 | 0.5% |
| de | 85 | 0.4% |
| Other values (8448) | 19192 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14806 | 11.2% | |
| e | 12495 | 9.5% |
| a | 7811 | 5.9% |
| o | 7042 | 5.3% |
| n | 6587 | 5.0% |
| r | 6505 | 4.9% |
| i | 6385 | 4.8% |
| t | 5907 | 4.5% |
| s | 4885 | 3.7% |
| h | 4166 | 3.2% |
| Other values (1836) | 55415 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 86418 | |
| Uppercase Letter | 19020 | 14.4% |
| Space Separator | 14825 | 11.2% |
| Other Letter | 7914 | 6.0% |
| Other Punctuation | 2064 | 1.6% |
| Decimal Number | 1041 | 0.8% |
| Dash Punctuation | 241 | 0.2% |
| Modifier Letter | 225 | 0.2% |
| Nonspacing Mark | 86 | 0.1% |
| Math Symbol | 39 | < 0.1% |
| Other values (10) | 131 | 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| の | 229 | 2.9% |
| ン | 206 | 2.6% |
| ラ | 101 | 1.3% |
| ス | 100 | 1.3% |
| ト | 86 | 1.1% |
| 場 | 83 | 1.0% |
| 劇 | 82 | 1.0% |
| ド | 81 | 1.0% |
| 版 | 81 | 1.0% |
| ル | 80 | 1.0% |
| Other values (1533) | 6785 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12495 | |
| a | 7811 | 9.0% |
| o | 7042 | 8.1% |
| n | 6587 | 7.6% |
| r | 6505 | 7.5% |
| i | 6385 | 7.4% |
| t | 5907 | 6.8% |
| s | 4885 | 5.7% |
| h | 4166 | 4.8% |
| l | 4156 | 4.8% |
| Other values (111) | 20479 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2435 | 12.8% |
| S | 1571 | 8.3% |
| M | 1221 | 6.4% |
| B | 1193 | 6.3% |
| A | 1123 | 5.9% |
| D | 1067 | 5.6% |
| C | 1051 | 5.5% |
| L | 996 | 5.2% |
| P | 890 | 4.7% |
| H | 837 | 4.4% |
| Other values (60) | 6636 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 863 | |
| ' | 381 | |
| . | 236 | 11.4% |
| ! | 142 | 6.9% |
| , | 129 | 6.2% |
| & | 115 | 5.6% |
| ・ | 46 | 2.2% |
| / | 32 | 1.6% |
| ? | 28 | 1.4% |
| : | 21 | 1.0% |
| Other values (14) | 71 | 3.4% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ั | 14 | |
| ์ | 10 | |
| ้ | 9 | |
| ิ | 7 | 8.1% |
| ุ | 5 | 5.8% |
| ゙ | 5 | 5.8% |
| ่ | 5 | 5.8% |
| ं | 4 | 4.7% |
| ู | 4 | 4.7% |
| े | 4 | 4.7% |
| Other values (12) | 19 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 330 | |
| 3 | 174 | |
| 1 | 150 | |
| 0 | 115 | 11.0% |
| 4 | 69 | 6.6% |
| 9 | 48 | 4.6% |
| 5 | 45 | 4.3% |
| 7 | 39 | 3.7% |
| 6 | 34 | 3.3% |
| 8 | 34 | 3.3% |
| Other values (2) | 3 | 0.3% |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 13 | |
| ी | 2 | 9.5% |
| ि | 2 | 9.5% |
| ం | 1 | 4.8% |
| ो | 1 | 4.8% |
| ை | 1 | 4.8% |
| ி | 1 | 4.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 10 | |
| ) | 8 | |
| 」 | 7 | |
| ) | 6 | |
| 〉 | 3 | 8.3% |
| 』 | 1 | 2.8% |
| 】 | 1 | 2.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 10 | |
| ( | 8 | |
| 「 | 7 | |
| ( | 6 | |
| 〈 | 3 | 8.3% |
| 『 | 1 | 2.8% |
| 【 | 1 | 2.8% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 26 | |
| × | 5 | 12.8% |
| + | 4 | 10.3% |
| ~ | 2 | 5.1% |
| + | 1 | 2.6% |
| ∞ | 1 | 2.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 232 | |
| 〜 | 6 | 2.5% |
| ― | 2 | 0.8% |
| – | 1 | 0.4% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 5 | |
| ³ | 2 | 22.2% |
| ⅓ | 1 | 11.1% |
| ² | 1 | 11.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| ” | 1 | 12.5% |
| » | 1 | 12.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ☆ | 4 | |
| △ | 1 | 16.7% |
| ° | 1 | 16.7% |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 3 | |
| Ⅰ | 2 | |
| Ⅲ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 14806 | ||
| 19 | 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 224 | |
| ʻ | 1 | 0.4% |
Format
| Value | Count | Frequency (%) |
| | 2 | |
| | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| $ | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 | |
| « | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 104627 | |
| Common | 18536 | 14.0% |
| Han | 3101 | 2.3% |
| Katakana | 2147 | 1.6% |
| Hangul | 1226 | 0.9% |
| Hiragana | 1039 | 0.8% |
| Cyrillic | 795 | 0.6% |
| Thai | 323 | 0.2% |
| Devanagari | 85 | 0.1% |
| Arabic | 79 | 0.1% |
| Other values (4) | 46 | < 0.1% |
Most frequent character per script
Han
| Value | Count | Frequency (%) |
| 場 | 83 | 2.7% |
| 劇 | 82 | 2.6% |
| 版 | 81 | 2.6% |
| 女 | 44 | 1.4% |
| 之 | 37 | 1.2% |
| 大 | 31 | 1.0% |
| 人 | 28 | 0.9% |
| 神 | 24 | 0.8% |
| 天 | 23 | 0.7% |
| 少 | 22 | 0.7% |
| Other values (931) | 2646 |
Hangul
| Value | Count | Frequency (%) |
| 의 | 43 | 3.5% |
| 이 | 23 | 1.9% |
| 한 | 20 | 1.6% |
| 마 | 20 | 1.6% |
| 기 | 18 | 1.5% |
| 사 | 17 | 1.4% |
| 시 | 16 | 1.3% |
| 아 | 16 | 1.3% |
| 인 | 15 | 1.2% |
| 엄 | 14 | 1.1% |
| Other values (356) | 1024 |
Latin
| Value | Count | Frequency (%) |
| e | 12495 | 11.9% |
| a | 7811 | 7.5% |
| o | 7042 | 6.7% |
| n | 6587 | 6.3% |
| r | 6505 | 6.2% |
| i | 6385 | 6.1% |
| t | 5907 | 5.6% |
| s | 4885 | 4.7% |
| h | 4166 | 4.0% |
| l | 4156 | 4.0% |
| Other values (111) | 38688 |
Katakana
| Value | Count | Frequency (%) |
| ン | 206 | 9.6% |
| ラ | 101 | 4.7% |
| ス | 100 | 4.7% |
| ト | 86 | 4.0% |
| ド | 81 | 3.8% |
| ル | 80 | 3.7% |
| イ | 71 | 3.3% |
| ア | 68 | 3.2% |
| リ | 56 | 2.6% |
| ッ | 55 | 2.6% |
| Other values (69) | 1243 |
Common
| Value | Count | Frequency (%) |
| 14806 | ||
| : | 863 | 4.7% |
| ' | 381 | 2.1% |
| 2 | 330 | 1.8% |
| . | 236 | 1.3% |
| - | 232 | 1.3% |
| ー | 224 | 1.2% |
| 3 | 174 | 0.9% |
| 1 | 150 | 0.8% |
| ! | 142 | 0.8% |
| Other values (68) | 998 | 5.4% |
Hiragana
| Value | Count | Frequency (%) |
| の | 229 | |
| と | 49 | 4.7% |
| ん | 42 | 4.0% |
| る | 36 | 3.5% |
| た | 35 | 3.4% |
| い | 34 | 3.3% |
| か | 34 | 3.3% |
| を | 33 | 3.2% |
| し | 31 | 3.0% |
| ら | 28 | 2.7% |
| Other values (56) | 488 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 74 | 9.3% |
| о | 73 | 9.2% |
| е | 61 | 7.7% |
| р | 55 | 6.9% |
| и | 51 | 6.4% |
| н | 49 | 6.2% |
| т | 36 | 4.5% |
| к | 32 | 4.0% |
| л | 30 | 3.8% |
| в | 26 | 3.3% |
| Other values (44) | 308 |
Thai
| Value | Count | Frequency (%) |
| า | 25 | 7.7% |
| ก | 23 | 7.1% |
| ร | 21 | 6.5% |
| อ | 17 | 5.3% |
| เ | 15 | 4.6% |
| ั | 14 | 4.3% |
| ม | 14 | 4.3% |
| ต | 13 | 4.0% |
| น | 13 | 4.0% |
| ง | 11 | 3.4% |
| Other values (38) | 157 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 13 | 15.3% |
| न | 5 | 5.9% |
| क | 5 | 5.9% |
| ल | 5 | 5.9% |
| ध | 4 | 4.7% |
| म | 4 | 4.7% |
| ं | 4 | 4.7% |
| र | 4 | 4.7% |
| ग | 4 | 4.7% |
| े | 4 | 4.7% |
| Other values (21) | 33 |
Arabic
| Value | Count | Frequency (%) |
| ا | 12 | |
| ر | 7 | 8.9% |
| م | 6 | 7.6% |
| ل | 6 | 7.6% |
| س | 5 | 6.3% |
| ب | 4 | 5.1% |
| ف | 4 | 5.1% |
| ن | 3 | 3.8% |
| ه | 3 | 3.8% |
| و | 3 | 3.8% |
| Other values (16) | 26 |
Greek
| Value | Count | Frequency (%) |
| ς | 3 | 13.0% |
| ν | 2 | 8.7% |
| ο | 2 | 8.7% |
| Κ | 1 | 4.3% |
| υ | 1 | 4.3% |
| ό | 1 | 4.3% |
| δ | 1 | 4.3% |
| τ | 1 | 4.3% |
| α | 1 | 4.3% |
| η | 1 | 4.3% |
| Other values (9) | 9 |
Telugu
| Value | Count | Frequency (%) |
| డ | 2 | |
| ీ | 1 | |
| జ | 1 | |
| ా | 1 | |
| ం | 1 | |
| బ | 1 | |
| ర | 1 | |
| ె | 1 | |
| ్ | 1 | |
| ి | 1 |
Tamil
| Value | Count | Frequency (%) |
| க | 1 | |
| ை | 1 | |
| த | 1 | |
| ி | 1 |
Inherited
| Value | Count | Frequency (%) |
| ゙ | 5 | |
| ̀ | 2 | 25.0% |
| | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 122335 | |
| CJK | 3101 | 2.3% |
| Katakana | 2417 | 1.8% |
| Hangul | 1226 | 0.9% |
| Hiragana | 1044 | 0.8% |
| Cyrillic | 795 | 0.6% |
| None | 551 | 0.4% |
| Thai | 323 | 0.2% |
| Devanagari | 85 | 0.1% |
| Arabic | 79 | 0.1% |
| Other values (10) | 48 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14806 | 12.1% | |
| e | 12495 | 10.2% |
| a | 7811 | 6.4% |
| o | 7042 | 5.8% |
| n | 6587 | 5.4% |
| r | 6505 | 5.3% |
| i | 6385 | 5.2% |
| t | 5907 | 4.8% |
| s | 4885 | 4.0% |
| h | 4166 | 3.4% |
| Other values (74) | 45746 |
Hiragana
| Value | Count | Frequency (%) |
| の | 229 | |
| と | 49 | 4.7% |
| ん | 42 | 4.0% |
| る | 36 | 3.4% |
| た | 35 | 3.4% |
| い | 34 | 3.3% |
| か | 34 | 3.3% |
| を | 33 | 3.2% |
| し | 31 | 3.0% |
| ら | 28 | 2.7% |
| Other values (57) | 493 |
Katakana
| Value | Count | Frequency (%) |
| ー | 224 | 9.3% |
| ン | 206 | 8.5% |
| ラ | 101 | 4.2% |
| ス | 100 | 4.1% |
| ト | 86 | 3.6% |
| ド | 81 | 3.4% |
| ル | 80 | 3.3% |
| イ | 71 | 2.9% |
| ア | 68 | 2.8% |
| リ | 56 | 2.3% |
| Other values (71) | 1344 |
None
| Value | Count | Frequency (%) |
| é | 92 | 16.7% |
| è | 30 | 5.4% |
| ~ | 26 | 4.7% |
| ó | 25 | 4.5% |
| : | 21 | 3.8% |
| í | 20 | 3.6% |
| 19 | 3.4% | |
| á | 17 | 3.1% |
| à | 16 | 2.9% |
| ! | 14 | 2.5% |
| Other values (107) | 271 |
CJK
| Value | Count | Frequency (%) |
| 場 | 83 | 2.7% |
| 劇 | 82 | 2.6% |
| 版 | 81 | 2.6% |
| 女 | 44 | 1.4% |
| 之 | 37 | 1.2% |
| 大 | 31 | 1.0% |
| 人 | 28 | 0.9% |
| 神 | 24 | 0.8% |
| 天 | 23 | 0.7% |
| 少 | 22 | 0.7% |
| Other values (931) | 2646 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 74 | 9.3% |
| о | 73 | 9.2% |
| е | 61 | 7.7% |
| р | 55 | 6.9% |
| и | 51 | 6.4% |
| н | 49 | 6.2% |
| т | 36 | 4.5% |
| к | 32 | 4.0% |
| л | 30 | 3.8% |
| в | 26 | 3.3% |
| Other values (44) | 308 |
Hangul
| Value | Count | Frequency (%) |
| 의 | 43 | 3.5% |
| 이 | 23 | 1.9% |
| 한 | 20 | 1.6% |
| 마 | 20 | 1.6% |
| 기 | 18 | 1.5% |
| 사 | 17 | 1.4% |
| 시 | 16 | 1.3% |
| 아 | 16 | 1.3% |
| 인 | 15 | 1.2% |
| 엄 | 14 | 1.1% |
| Other values (356) | 1024 |
Thai
| Value | Count | Frequency (%) |
| า | 25 | 7.7% |
| ก | 23 | 7.1% |
| ร | 21 | 6.5% |
| อ | 17 | 5.3% |
| เ | 15 | 4.6% |
| ั | 14 | 4.3% |
| ม | 14 | 4.3% |
| ต | 13 | 4.0% |
| น | 13 | 4.0% |
| ง | 11 | 3.4% |
| Other values (38) | 157 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 13 | 15.3% |
| न | 5 | 5.9% |
| क | 5 | 5.9% |
| ल | 5 | 5.9% |
| ध | 4 | 4.7% |
| म | 4 | 4.7% |
| ं | 4 | 4.7% |
| र | 4 | 4.7% |
| ग | 4 | 4.7% |
| े | 4 | 4.7% |
| Other values (21) | 33 |
Arabic
| Value | Count | Frequency (%) |
| ا | 12 | |
| ر | 7 | 8.9% |
| م | 6 | 7.6% |
| ل | 6 | 7.6% |
| س | 5 | 6.3% |
| ب | 4 | 5.1% |
| ف | 4 | 5.1% |
| ن | 3 | 3.8% |
| ه | 3 | 3.8% |
| و | 3 | 3.8% |
| Other values (16) | 26 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| | 2 | 14.3% |
| ― | 2 | 14.3% |
| ” | 1 | 7.1% |
| “ | 1 | 7.1% |
| | 1 | 7.1% |
| – | 1 | 7.1% |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 4 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 3 | |
| Ⅰ | 2 | |
| Ⅲ | 2 | |
| ⅓ | 1 | 12.5% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ọ | 2 |
Diacriticals
| Value | Count | Frequency (%) |
| ̀ | 2 |
Telugu
| Value | Count | Frequency (%) |
| డ | 2 | |
| ీ | 1 | |
| జ | 1 | |
| ా | 1 | |
| ం | 1 | |
| బ | 1 | |
| ర | 1 | |
| ె | 1 | |
| ్ | 1 | |
| ి | 1 |
Geometric Shapes
| Value | Count | Frequency (%) |
| △ | 1 |
Math Operators
| Value | Count | Frequency (%) |
| ∞ | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
Tamil
| Value | Count | Frequency (%) |
| க | 1 | |
| ை | 1 | |
| த | 1 | |
| ி | 1 |
Overview
Text
| Distinct | 8539 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
Length
| Max length | 579 |
|---|---|
| Median length | 371 |
| Mean length | 149.43616 |
| Min length | 0 |
Characters and Unicode
| Total characters | 1277978 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 12 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 8536 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | skilled thief corporate espionage subconscious target chance regain old life payment task considered impossible inception implantation another person idea target subconscious |
|---|---|
| 2nd row | also known black widow part ledger dangerous conspiracy tie past force stop nothing bring must deal history spy broken relationship left wake long avenger |
| 3rd row | century matrix tell story computer hacker join group underground insurgent fighting vast powerful computer rule earth |
| 4th row | uninhabitable earth outcome civil war hinge brain elite soldier create robot mercenary |
| 5th row | queen poppy branch make surprising discovery — troll world beyond distinct difference create big clash various tribe mysterious threat put troll across land danger poppy branch band friend must embark epic quest create harmony among troll unite certain doom |
| Value | Count | Frequency (%) |
| life | 1648 | 0.9% |
| find | 1355 | 0.7% |
| one | 1141 | 0.6% |
| new | 1135 | 0.6% |
| young | 1124 | 0.6% |
| world | 1079 | 0.6% |
| friend | 958 | 0.5% |
| family | 926 | 0.5% |
| must | 896 | 0.5% |
| two | 835 | 0.4% |
| Other values (12502) | 178468 |
Most occurring characters
| Value | Count | Frequency (%) |
| 181014 | ||
| e | 139143 | 10.9% |
| r | 85547 | 6.7% |
| a | 83457 | 6.5% |
| t | 81496 | 6.4% |
| i | 79836 | 6.2% |
| n | 78850 | 6.2% |
| o | 73680 | 5.8% |
| l | 58945 | 4.6% |
| s | 58884 | 4.6% |
| Other values (36) | 357126 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1095343 | |
| Space Separator | 181014 | 14.2% |
| Dash Punctuation | 746 | 0.1% |
| Final Punctuation | 652 | 0.1% |
| Initial Punctuation | 118 | < 0.1% |
| Other Punctuation | 85 | < 0.1% |
| Other Symbol | 11 | < 0.1% |
| Nonspacing Mark | 3 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
| Format | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 139143 | |
| r | 85547 | 7.8% |
| a | 83457 | 7.6% |
| t | 81496 | 7.4% |
| i | 79836 | 7.3% |
| n | 78850 | 7.2% |
| o | 73680 | 6.7% |
| l | 58945 | 5.4% |
| s | 58884 | 5.4% |
| c | 42156 | 3.8% |
| Other values (16) | 313349 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 563 | |
| ” | 88 | 13.5% |
| » | 1 | 0.2% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 86 | |
| ‘ | 31 | 26.3% |
| « | 1 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| — | 507 | |
| – | 239 |
Other Punctuation
| Value | Count | Frequency (%) |
| … | 82 | |
| • | 3 | 3.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 9 | |
| ® | 2 | 18.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 2 | |
| ̈ | 1 |
Format
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 181014 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ¹ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1095343 | |
| Common | 182632 | 14.3% |
| Inherited | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 139143 | |
| r | 85547 | 7.8% |
| a | 83457 | 7.6% |
| t | 81496 | 7.4% |
| i | 79836 | 7.3% |
| n | 78850 | 7.2% |
| o | 73680 | 6.7% |
| l | 58945 | 5.4% |
| s | 58884 | 5.4% |
| c | 42156 | 3.8% |
| Other values (16) | 313349 |
Common
| Value | Count | Frequency (%) |
| 181014 | ||
| ’ | 563 | 0.3% |
| — | 507 | 0.3% |
| – | 239 | 0.1% |
| ” | 88 | < 0.1% |
| “ | 86 | < 0.1% |
| … | 82 | < 0.1% |
| ‘ | 31 | < 0.1% |
| ™ | 9 | < 0.1% |
| • | 3 | < 0.1% |
| Other values (8) | 10 | < 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 2 | |
| ̈ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1276357 | |
| Punctuation | 1601 | 0.1% |
| Letterlike Symbols | 9 | < 0.1% |
| None | 8 | < 0.1% |
| Diacriticals | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 181014 | ||
| e | 139143 | 10.9% |
| r | 85547 | 6.7% |
| a | 83457 | 6.5% |
| t | 81496 | 6.4% |
| i | 79836 | 6.3% |
| n | 78850 | 6.2% |
| o | 73680 | 5.8% |
| l | 58945 | 4.6% |
| s | 58884 | 4.6% |
| Other values (17) | 355505 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 563 | |
| — | 507 | |
| – | 239 | |
| ” | 88 | 5.5% |
| “ | 86 | 5.4% |
| … | 82 | 5.1% |
| ‘ | 31 | 1.9% |
| • | 3 | 0.2% |
| | 1 | 0.1% |
| | 1 | 0.1% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 9 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 2 | |
| ̈ | 1 |
None
| Value | Count | Frequency (%) |
| ´ | 2 | |
| ® | 2 | |
| « | 1 | |
| £ | 1 | |
| ¹ | 1 | |
| » | 1 |
Popularity
Real number (ℝ)
| Distinct | 6966 |
|---|---|
| Distinct (%) | 81.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0878946 |
| Minimum | 2.5687115 |
|---|---|
| Maximum | 4.4012163 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 2.5687115 |
|---|---|
| 5-th percentile | 2.6002831 |
| Q1 | 2.7389673 |
| median | 2.9672042 |
| Q3 | 3.3278109 |
| 95-th percentile | 4.0105634 |
| Maximum | 4.4012163 |
| Range | 1.8325048 |
| Interquartile range (IQR) | 0.58884361 |
Descriptive statistics
| Standard deviation | 0.43646712 |
|---|---|
| Coefficient of variation (CV) | 0.1413478 |
| Kurtosis | 0.23845823 |
| Mean | 3.0878946 |
| Median Absolute Deviation (MAD) | 0.26624574 |
| Skewness | 1.0027189 |
| Sum | 26407.675 |
| Variance | 0.19050355 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 2.587538446 | 6 | 0.1% |
| 2.740775506 | 6 | 0.1% |
| 2.781672341 | 6 | 0.1% |
| 2.690497042 | 5 | 0.1% |
| 2.66924007 | 4 | < 0.1% |
| 2.892757798 | 4 | < 0.1% |
| 2.64808786 | 4 | < 0.1% |
| 2.615203651 | 4 | < 0.1% |
| 2.647379745 | 4 | < 0.1% |
| 2.719715233 | 4 | < 0.1% |
| Other values (6956) | 8505 |
| Value | Count | Frequency (%) |
| 2.568711502 | 3 | |
| 2.568788134 | 1 | < 0.1% |
| 2.568864759 | 3 | |
| 2.568941379 | 2 | |
| 2.5690946 | 2 | |
| 2.569171202 | 2 | |
| 2.569247798 | 1 | < 0.1% |
| 2.569324388 | 2 | |
| 2.569477551 | 1 | < 0.1% |
| 2.569783807 | 2 |
| Value | Count | Frequency (%) |
| 4.401216329 | 1 | |
| 4.399596379 | 1 | |
| 4.393979909 | 1 | |
| 4.393819327 | 1 | |
| 4.393436296 | 1 | |
| 4.390862483 | 1 | |
| 4.385906598 | 1 | |
| 4.384972292 | 1 | |
| 4.384934902 | 1 | |
| 4.383575435 | 1 |
ReleaseDate
Date
| Distinct | 5445 |
|---|---|
| Distinct (%) | 63.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| Minimum | 1920-02-27 00:00:00 |
|---|---|
| Maximum | 2023-10-26 00:00:00 |
Title
Text
| Distinct | 8264 |
|---|---|
| Distinct (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
Length
| Max length | 104 |
|---|---|
| Median length | 70 |
| Mean length | 16.821562 |
| Min length | 1 |
Characters and Unicode
| Total characters | 143858 |
|---|---|
| Distinct characters | 127 |
| Distinct categories | 17 ? |
| Distinct scripts | 5 ? |
| Distinct blocks | 9 ? |
Unique
| Unique | 8002 ? |
|---|---|
| Unique (%) | 93.6% |
Sample
| 1st row | Inception |
|---|---|
| 2nd row | Black Widow |
| 3rd row | The Matrix |
| 4th row | JUNG_E |
| 5th row | Trolls World Tour |
| Value | Count | Frequency (%) |
| the | 2893 | 11.3% |
| of | 869 | 3.4% |
| a | 358 | 1.4% |
| and | 285 | 1.1% |
| in | 259 | 1.0% |
| 2 | 252 | 1.0% |
| to | 190 | 0.7% |
| 174 | 0.7% | |
| movie | 147 | 0.6% |
| love | 109 | 0.4% |
| Other values (6917) | 20090 |
Most occurring characters
| Value | Count | Frequency (%) |
| 17073 | 11.9% | |
| e | 14842 | 10.3% |
| a | 8822 | 6.1% |
| o | 8648 | 6.0% |
| n | 7771 | 5.4% |
| r | 7587 | 5.3% |
| i | 7505 | 5.2% |
| t | 7162 | 5.0% |
| s | 5581 | 3.9% |
| h | 5379 | 3.7% |
| Other values (117) | 53488 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 100686 | |
| Uppercase Letter | 22376 | 15.6% |
| Space Separator | 17075 | 11.9% |
| Other Punctuation | 2334 | 1.6% |
| Decimal Number | 1061 | 0.7% |
| Dash Punctuation | 258 | 0.2% |
| Close Punctuation | 18 | < 0.1% |
| Open Punctuation | 18 | < 0.1% |
| Other Number | 10 | < 0.1% |
| Other Letter | 8 | < 0.1% |
| Other values (7) | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14842 | |
| a | 8822 | 8.8% |
| o | 8648 | 8.6% |
| n | 7771 | 7.7% |
| r | 7587 | 7.5% |
| i | 7505 | 7.5% |
| t | 7162 | 7.1% |
| s | 5581 | 5.5% |
| h | 5379 | 5.3% |
| l | 4721 | 4.7% |
| Other values (30) | 22668 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2977 | 13.3% |
| S | 1958 | 8.8% |
| M | 1506 | 6.7% |
| B | 1442 | 6.4% |
| A | 1279 | 5.7% |
| C | 1271 | 5.7% |
| D | 1258 | 5.6% |
| P | 1063 | 4.8% |
| L | 1055 | 4.7% |
| H | 949 | 4.2% |
| Other values (20) | 7618 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1182 | |
| ' | 416 | 17.8% |
| . | 241 | 10.3% |
| , | 151 | 6.5% |
| ! | 138 | 5.9% |
| & | 124 | 5.3% |
| / | 33 | 1.4% |
| ? | 28 | 1.2% |
| * | 7 | 0.3% |
| ¡ | 3 | 0.1% |
| Other values (7) | 11 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 337 | |
| 3 | 175 | |
| 1 | 153 | |
| 0 | 127 | 12.0% |
| 4 | 73 | 6.9% |
| 9 | 49 | 4.6% |
| 5 | 44 | 4.1% |
| 7 | 39 | 3.7% |
| 8 | 33 | 3.1% |
| 6 | 31 | 2.9% |
Other Letter
| Value | Count | Frequency (%) |
| 爆 | 1 | |
| 撃 | 1 | |
| 衝 | 1 | |
| の | 1 | |
| 乳 | 1 | |
| 瞳 | 1 | |
| 中 | 1 | |
| 田 | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 5 | |
| ³ | 2 | 20.0% |
| ⅓ | 1 | 10.0% |
| ² | 1 | 10.0% |
| ⁴ | 1 | 10.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 247 | |
| – | 8 | 3.1% |
| — | 3 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 17073 | ||
| 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 | |
| ] | 5 | 27.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 | |
| [ | 5 | 27.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| $ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̀ | 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 123062 | |
| Common | 20786 | 14.4% |
| Han | 7 | < 0.1% |
| Inherited | 2 | < 0.1% |
| Hiragana | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14842 | 12.1% |
| a | 8822 | 7.2% |
| o | 8648 | 7.0% |
| n | 7771 | 6.3% |
| r | 7587 | 6.2% |
| i | 7505 | 6.1% |
| t | 7162 | 5.8% |
| s | 5581 | 4.5% |
| h | 5379 | 4.4% |
| l | 4721 | 3.8% |
| Other values (60) | 45044 |
Common
| Value | Count | Frequency (%) |
| 17073 | ||
| : | 1182 | 5.7% |
| ' | 416 | 2.0% |
| 2 | 337 | 1.6% |
| - | 247 | 1.2% |
| . | 241 | 1.2% |
| 3 | 175 | 0.8% |
| 1 | 153 | 0.7% |
| , | 151 | 0.7% |
| ! | 138 | 0.7% |
| Other values (38) | 673 | 3.2% |
Han
| Value | Count | Frequency (%) |
| 爆 | 1 | |
| 撃 | 1 | |
| 衝 | 1 | |
| 乳 | 1 | |
| 瞳 | 1 | |
| 中 | 1 | |
| 田 | 1 |
Inherited
| Value | Count | Frequency (%) |
| ̀ | 2 |
Hiragana
| Value | Count | Frequency (%) |
| の | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143739 | |
| None | 89 | 0.1% |
| Punctuation | 16 | < 0.1% |
| CJK | 7 | < 0.1% |
| Latin Ext Additional | 2 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
| Number Forms | 1 | < 0.1% |
| Hiragana | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 17073 | 11.9% | |
| e | 14842 | 10.3% |
| a | 8822 | 6.1% |
| o | 8648 | 6.0% |
| n | 7771 | 5.4% |
| r | 7587 | 5.3% |
| i | 7505 | 5.2% |
| t | 7162 | 5.0% |
| s | 5581 | 3.9% |
| h | 5379 | 3.7% |
| Other values (75) | 53369 |
None
| Value | Count | Frequency (%) |
| é | 43 | |
| ½ | 5 | 5.6% |
| á | 4 | 4.5% |
| í | 4 | 4.5% |
| è | 3 | 3.4% |
| ó | 3 | 3.4% |
| ¡ | 3 | 3.4% |
| à | 3 | 3.4% |
| ¿ | 2 | 2.2% |
| ¢ | 2 | 2.2% |
| Other values (16) | 17 | 19.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 8 | |
| — | 3 | 18.8% |
| ’ | 3 | 18.8% |
| 2 | 12.5% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ọ | 2 |
Diacriticals
| Value | Count | Frequency (%) |
| ̀ | 2 |
CJK
| Value | Count | Frequency (%) |
| 爆 | 1 | |
| 撃 | 1 | |
| 衝 | 1 | |
| 乳 | 1 | |
| 瞳 | 1 | |
| 中 | 1 | |
| 田 | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| の | 1 |
VoteAverage
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 43 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5354654 |
| Minimum | 4.4 |
|---|---|
| Maximum | 8.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 4.4 |
|---|---|
| 5-th percentile | 5.2 |
| Q1 | 6 |
| median | 6.6 |
| Q3 | 7.1 |
| 95-th percentile | 7.8 |
| Maximum | 8.6 |
| Range | 4.2 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 0.80047008 |
|---|---|
| Coefficient of variation (CV) | 0.12248096 |
| Kurtosis | -0.33411624 |
| Mean | 6.5354654 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | -0.15746386 |
| Sum | 55891.3 |
| Variance | 0.64075235 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.5 | 425 | 5.0% |
| 6.3 | 409 | 4.8% |
| 6.6 | 402 | 4.7% |
| 6.8 | 398 | 4.7% |
| 7 | 394 | 4.6% |
| 6.2 | 385 | 4.5% |
| 6.7 | 385 | 4.5% |
| 6.4 | 381 | 4.5% |
| 6.9 | 376 | 4.4% |
| 6.1 | 374 | 4.4% |
| Other values (33) | 4623 |
| Value | Count | Frequency (%) |
| 4.4 | 22 | 0.3% |
| 4.5 | 38 | 0.4% |
| 4.6 | 39 | 0.5% |
| 4.7 | 46 | 0.5% |
| 4.8 | 48 | 0.6% |
| 4.9 | 62 | |
| 5 | 79 | |
| 5.1 | 84 | |
| 5.2 | 121 | |
| 5.3 | 139 |
| Value | Count | Frequency (%) |
| 8.6 | 2 | < 0.1% |
| 8.5 | 10 | 0.1% |
| 8.4 | 26 | 0.3% |
| 8.3 | 39 | 0.5% |
| 8.2 | 55 | 0.6% |
| 8.1 | 61 | 0.7% |
| 8 | 87 | |
| 7.9 | 111 | |
| 7.8 | 129 | |
| 7.7 | 153 |
VoteCount
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3261 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2986884 |
| Minimum | 0 |
|---|---|
| Maximum | 10.452418 |
| Zeros | 27 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.2580965 |
| Q1 | 5.370638 |
| median | 6.4281053 |
| Q3 | 7.4312997 |
| 95-th percentile | 8.7395204 |
| Maximum | 10.452418 |
| Range | 10.452418 |
| Interquartile range (IQR) | 2.0606616 |
Descriptive statistics
| Standard deviation | 1.6593754 |
|---|---|
| Coefficient of variation (CV) | 0.26344777 |
| Kurtosis | 0.95554062 |
| Mean | 6.2986884 |
| Median Absolute Deviation (MAD) | 1.0254422 |
| Skewness | -0.70407864 |
| Sum | 53866.383 |
| Variance | 2.7535267 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6931471806 | 32 | 0.4% |
| 1.609437912 | 30 | 0.4% |
| 1.386294361 | 28 | 0.3% |
| 0 | 27 | 0.3% |
| 1.945910149 | 26 | 0.3% |
| 2.079441542 | 25 | 0.3% |
| 2.302585093 | 25 | 0.3% |
| 1.098612289 | 21 | 0.2% |
| 3.583518938 | 19 | 0.2% |
| 4.700480366 | 19 | 0.2% |
| Other values (3251) | 8300 |
| Value | Count | Frequency (%) |
| 0 | 27 | |
| 0.6931471806 | 32 | |
| 1.098612289 | 21 | |
| 1.386294361 | 28 | |
| 1.609437912 | 30 | |
| 1.791759469 | 17 | |
| 1.945910149 | 26 | |
| 2.079441542 | 25 | |
| 2.197224577 | 18 | |
| 2.302585093 | 25 |
| Value | Count | Frequency (%) |
| 10.45241788 | 1 | |
| 10.19320505 | 1 | |
| 10.08434971 | 1 | |
| 10.01721794 | 1 | |
| 9.980448594 | 1 | |
| 9.957833682 | 1 | |
| 9.957549511 | 1 | |
| 9.951896692 | 1 | |
| 9.951277216 | 1 | |
| 9.937888979 | 1 |
Budget
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 639 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.5715398 |
| Minimum | 0 |
|---|---|
| Maximum | 19.519293 |
| Zeros | 3604 |
| Zeros (%) | 42.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 15.045134 |
| Q3 | 17.034386 |
| 95-th percentile | 18.289713 |
| Maximum | 19.519293 |
| Range | 19.519293 |
| Interquartile range (IQR) | 17.034386 |
Descriptive statistics
| Standard deviation | 8.2632118 |
|---|---|
| Coefficient of variation (CV) | 0.8633106 |
| Kurtosis | -1.8715781 |
| Mean | 9.5715398 |
| Median Absolute Deviation (MAD) | 3.0878647 |
| Skewness | -0.26021247 |
| Sum | 81855.808 |
| Variance | 68.280669 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3604 | |
| 16.81124283 | 193 | 2.3% |
| 17.21670794 | 173 | 2.0% |
| 17.03438638 | 167 | 2.0% |
| 16.11809565 | 159 | 1.9% |
| 16.52356076 | 149 | 1.7% |
| 17.50439001 | 143 | 1.7% |
| 15.42494847 | 141 | 1.6% |
| 17.72753356 | 128 | 1.5% |
| 17.37085862 | 124 | 1.4% |
| Other values (629) | 3571 |
| Value | Count | Frequency (%) |
| 0 | 3604 | |
| 1.386294361 | 1 | < 0.1% |
| 1.609437912 | 1 | < 0.1% |
| 1.791759469 | 1 | < 0.1% |
| 1.945910149 | 1 | < 0.1% |
| 2.995732274 | 1 | < 0.1% |
| 3.258096538 | 1 | < 0.1% |
| 3.555348061 | 1 | < 0.1% |
| 4.418840608 | 1 | < 0.1% |
| 4.48863637 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 19.51929303 | 1 | < 0.1% |
| 19.33697148 | 6 | 0.1% |
| 19.31676877 | 2 | < 0.1% |
| 19.25358987 | 1 | < 0.1% |
| 19.23161096 | 3 | < 0.1% |
| 19.18614859 | 1 | < 0.1% |
| 19.15784481 | 1 | < 0.1% |
| 19.13852054 | 1 | < 0.1% |
| 19.11382792 | 32 | |
| 19.08851012 | 1 | < 0.1% |
TagLine
Text
MISSING 
| Distinct | 6728 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 1781 |
| Missing (%) | 20.8% |
| Memory size | 391.7 KiB |
Length
| Max length | 206 |
|---|---|
| Median length | 142 |
| Mean length | 40.110914 |
| Min length | 3 |
Characters and Unicode
| Total characters | 271591 |
|---|---|
| Distinct characters | 99 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 6688 ? |
|---|---|
| Unique (%) | 98.8% |
Sample
| 1st row | Your mind is the scene of the crime. |
|---|---|
| 2nd row | Her world. Her secrets. Her legacy. |
| 3rd row | Welcome to the Real World. |
| 4th row | AI Combat Warrior Will be Unleashed. |
| 5th row | Happiest. Movie. Ever. |
| Value | Count | Frequency (%) |
| the | 3146 | 6.3% |
| a | 1814 | 3.6% |
| to | 1094 | 2.2% |
| of | 1018 | 2.0% |
| is | 1018 | 2.0% |
| you | 893 | 1.8% |
| in | 708 | 1.4% |
| and | 517 | 1.0% |
| for | 513 | 1.0% |
| one | 458 | 0.9% |
| Other values (5958) | 38869 |
Most occurring characters
| Value | Count | Frequency (%) |
| 43285 | ||
| e | 28672 | 10.6% |
| t | 16834 | 6.2% |
| o | 16785 | 6.2% |
| a | 14504 | 5.3% |
| n | 13712 | 5.0% |
| i | 13271 | 4.9% |
| r | 13223 | 4.9% |
| s | 12809 | 4.7% |
| h | 10845 | 4.0% |
| Other values (89) | 87651 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 199889 | |
| Space Separator | 43291 | 15.9% |
| Uppercase Letter | 14781 | 5.4% |
| Other Punctuation | 12453 | 4.6% |
| Decimal Number | 810 | 0.3% |
| Dash Punctuation | 237 | 0.1% |
| Final Punctuation | 96 | < 0.1% |
| Open Punctuation | 10 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Currency Symbol | 8 | < 0.1% |
| Other values (4) | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 28672 | |
| t | 16834 | 8.4% |
| o | 16785 | 8.4% |
| a | 14504 | 7.3% |
| n | 13712 | 6.9% |
| i | 13271 | 6.6% |
| r | 13223 | 6.6% |
| s | 12809 | 6.4% |
| h | 10845 | 5.4% |
| l | 8620 | 4.3% |
| Other values (24) | 50614 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2378 | |
| A | 1310 | 8.9% |
| S | 1070 | 7.2% |
| W | 888 | 6.0% |
| H | 871 | 5.9% |
| I | 864 | 5.8% |
| B | 735 | 5.0% |
| N | 674 | 4.6% |
| F | 665 | 4.5% |
| E | 645 | 4.4% |
| Other values (16) | 4681 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8344 | |
| ' | 1589 | 12.8% |
| , | 1115 | 9.0% |
| ! | 787 | 6.3% |
| ? | 385 | 3.1% |
| … | 89 | 0.7% |
| " | 55 | 0.4% |
| : | 34 | 0.3% |
| % | 16 | 0.1% |
| * | 13 | 0.1% |
| Other values (4) | 26 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 240 | |
| 1 | 171 | |
| 2 | 96 | 11.9% |
| 9 | 61 | 7.5% |
| 3 | 55 | 6.8% |
| 5 | 44 | 5.4% |
| 6 | 39 | 4.8% |
| 7 | 38 | 4.7% |
| 8 | 34 | 4.2% |
| 4 | 32 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 227 | |
| — | 6 | 2.5% |
| – | 4 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 43285 | ||
| 6 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 94 | |
| ” | 2 | 2.1% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 2 | |
| ‘ | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 8 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 214670 | |
| Common | 56921 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 28672 | |
| t | 16834 | 7.8% |
| o | 16785 | 7.8% |
| a | 14504 | 6.8% |
| n | 13712 | 6.4% |
| i | 13271 | 6.2% |
| r | 13223 | 6.2% |
| s | 12809 | 6.0% |
| h | 10845 | 5.1% |
| l | 8620 | 4.0% |
| Other values (50) | 65395 |
Common
| Value | Count | Frequency (%) |
| 43285 | ||
| . | 8344 | 14.7% |
| ' | 1589 | 2.8% |
| , | 1115 | 2.0% |
| ! | 787 | 1.4% |
| ? | 385 | 0.7% |
| 0 | 240 | 0.4% |
| - | 227 | 0.4% |
| 1 | 171 | 0.3% |
| 2 | 96 | 0.2% |
| Other values (29) | 682 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 271371 | |
| Punctuation | 198 | 0.1% |
| None | 21 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 43285 | ||
| e | 28672 | 10.6% |
| t | 16834 | 6.2% |
| o | 16785 | 6.2% |
| a | 14504 | 5.3% |
| n | 13712 | 5.1% |
| i | 13271 | 4.9% |
| r | 13223 | 4.9% |
| s | 12809 | 4.7% |
| h | 10845 | 4.0% |
| Other values (71) | 87431 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 94 | |
| … | 89 | |
| — | 6 | 3.0% |
| – | 4 | 2.0% |
| “ | 2 | 1.0% |
| ” | 2 | 1.0% |
| ‘ | 1 | 0.5% |
None
| Value | Count | Frequency (%) |
| 6 | ||
| é | 4 | |
| ñ | 2 | 9.5% |
| ü | 2 | 9.5% |
| á | 2 | 9.5% |
| ō | 1 | 4.8% |
| í | 1 | 4.8% |
| ½ | 1 | 4.8% |
| ù | 1 | 4.8% |
| ê | 1 | 4.8% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
RunTime
Real number (ℝ)
| Distinct | 97 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.63599 |
| Minimum | 55 |
|---|---|
| Maximum | 151 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 55 |
|---|---|
| 5-th percentile | 79 |
| Q1 | 92 |
| median | 102 |
| Q3 | 115 |
| 95-th percentile | 134 |
| Maximum | 151 |
| Range | 96 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 16.675884 |
|---|---|
| Coefficient of variation (CV) | 0.16090823 |
| Kurtosis | -0.051048792 |
| Mean | 103.63599 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.32281219 |
| Sum | 886295 |
| Variance | 278.08512 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 95 | 269 | 3.1% |
| 90 | 262 | 3.1% |
| 100 | 246 | 2.9% |
| 93 | 240 | 2.8% |
| 105 | 230 | 2.7% |
| 97 | 229 | 2.7% |
| 98 | 226 | 2.6% |
| 94 | 217 | 2.5% |
| 101 | 214 | 2.5% |
| 96 | 212 | 2.5% |
| Other values (87) | 6207 |
| Value | Count | Frequency (%) |
| 55 | 3 | < 0.1% |
| 56 | 4 | < 0.1% |
| 57 | 3 | < 0.1% |
| 58 | 4 | < 0.1% |
| 59 | 7 | |
| 60 | 15 | |
| 61 | 8 | |
| 62 | 6 | 0.1% |
| 63 | 7 | |
| 64 | 10 |
| Value | Count | Frequency (%) |
| 151 | 12 | |
| 150 | 12 | |
| 149 | 13 | |
| 148 | 10 | |
| 147 | 18 | |
| 146 | 15 | |
| 145 | 21 | |
| 144 | 13 | |
| 143 | 24 | |
| 142 | 14 |
Revenue
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 5092 |
|---|---|
| Distinct (%) | 59.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.555716 |
| Minimum | 0 |
|---|---|
| Maximum | 21.449956 |
| Zeros | 3247 |
| Zeros (%) | 38.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 15.57572 |
| Q3 | 17.822244 |
| 95-th percentile | 19.401506 |
| Maximum | 21.449956 |
| Range | 21.449956 |
| Interquartile range (IQR) | 17.822244 |
Descriptive statistics
| Standard deviation | 8.4310991 |
|---|---|
| Coefficient of variation (CV) | 0.79872354 |
| Kurtosis | -1.7385716 |
| Mean | 10.555716 |
| Median Absolute Deviation (MAD) | 3.3579764 |
| Skewness | -0.38812462 |
| Sum | 90272.486 |
| Variance | 71.083432 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3247 | |
| 16.21340583 | 11 | 0.1% |
| 14.50865774 | 10 | 0.1% |
| 16.30041721 | 10 | 0.1% |
| 17.03438638 | 8 | 0.1% |
| 16.11809565 | 8 | 0.1% |
| 15.76142071 | 8 | 0.1% |
| 17.21670794 | 8 | 0.1% |
| 15.42494847 | 7 | 0.1% |
| 15.8949521 | 6 | 0.1% |
| Other values (5082) | 5229 |
| Value | Count | Frequency (%) |
| 0 | 3247 | |
| 1.098612289 | 1 | < 0.1% |
| 1.945910149 | 1 | < 0.1% |
| 2.302585093 | 1 | < 0.1% |
| 3.36729583 | 1 | < 0.1% |
| 3.761200116 | 1 | < 0.1% |
| 4.543294782 | 1 | < 0.1% |
| 4.836281907 | 1 | < 0.1% |
| 5.303304908 | 1 | < 0.1% |
| 5.733341277 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 21.44995592 | 1 | |
| 21.23700967 | 1 | |
| 21.1389066 | 1 | |
| 21.02331567 | 1 | |
| 20.99364886 | 1 | |
| 20.95921976 | 1 | |
| 20.94063705 | 1 | |
| 20.93515034 | 1 | |
| 20.91848487 | 1 | |
| 20.86886373 | 1 |
Genres
Text
| Distinct | 2086 |
|---|---|
| Distinct (%) | 24.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
Length
| Max length | 84 |
|---|---|
| Median length | 60 |
| Mean length | 21.514032 |
| Min length | 0 |
Characters and Unicode
| Total characters | 183988 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1309 ? |
|---|---|
| Unique (%) | 15.3% |
Sample
| 1st row | Action, ScienceFiction, Adventure |
|---|---|
| 2nd row | Action, Adventure, ScienceFiction |
| 3rd row | Action, ScienceFiction |
| 4th row | ScienceFiction |
| 5th row | Family, Animation, Comedy, Fantasy, Adventure, Music |
| Value | Count | Frequency (%) |
| drama | 3327 | |
| comedy | 2665 | |
| thriller | 2403 | |
| action | 2361 | |
| adventure | 1589 | 7.0% |
| romance | 1385 | 6.1% |
| horror | 1363 | 6.0% |
| crime | 1198 | 5.3% |
| fantasy | 1121 | 4.9% |
| family | 1118 | 4.9% |
| Other values (10) | 4209 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 16620 | 9.0% |
| 14370 | 7.8% | |
| e | 14302 | 7.8% |
| , | 14040 | 7.6% |
| i | 12988 | 7.1% |
| a | 12745 | 6.9% |
| o | 11803 | 6.4% |
| n | 10836 | 5.9% |
| m | 10798 | 5.9% |
| t | 8514 | 4.6% |
| Other values (18) | 56972 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 131763 | |
| Uppercase Letter | 23815 | 12.9% |
| Space Separator | 14370 | 7.8% |
| Other Punctuation | 14040 | 7.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 16620 | |
| e | 14302 | |
| i | 12988 | |
| a | 12745 | |
| o | 11803 | |
| n | 10836 | |
| m | 10798 | |
| t | 8514 | |
| c | 7227 | 5.5% |
| y | 6940 | 5.3% |
| Other values (6) | 18990 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4961 | |
| C | 3863 | |
| D | 3421 | |
| F | 3301 | |
| T | 2572 | |
| H | 1693 | 7.1% |
| R | 1385 | 5.8% |
| M | 1176 | 4.9% |
| S | 1062 | 4.5% |
| W | 381 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 14370 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14040 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 155578 | |
| Common | 28410 | 15.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 16620 | 10.7% |
| e | 14302 | 9.2% |
| i | 12988 | 8.3% |
| a | 12745 | 8.2% |
| o | 11803 | 7.6% |
| n | 10836 | 7.0% |
| m | 10798 | 6.9% |
| t | 8514 | 5.5% |
| c | 7227 | 4.6% |
| y | 6940 | 4.5% |
| Other values (16) | 42805 |
Common
| Value | Count | Frequency (%) |
| 14370 | ||
| , | 14040 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 183988 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 16620 | 9.0% |
| 14370 | 7.8% | |
| e | 14302 | 7.8% |
| , | 14040 | 7.6% |
| i | 12988 | 7.1% |
| a | 12745 | 6.9% |
| o | 11803 | 6.4% |
| n | 10836 | 5.9% |
| m | 10798 | 5.9% |
| t | 8514 | 4.6% |
| Other values (18) | 56972 |
North America
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 6152 | |
| 0 | 2400 | 28.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 6152 | |
| 0 | 2400 | 28.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6152 | |
| 0 | 2400 | 28.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6152 | |
| 0 | 2400 | 28.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6152 | |
| 0 | 2400 | 28.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6152 | |
| 0 | 2400 | 28.1% |
Europe
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6137 | |
| 1 | 2415 | 28.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 6137 | |
| 1 | 2415 | 28.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6137 | |
| 1 | 2415 | 28.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6137 | |
| 1 | 2415 | 28.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6137 | |
| 1 | 2415 | 28.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6137 | |
| 1 | 2415 | 28.2% |
Asia
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7132 | |
| 1 | 1420 | 16.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 7132 | |
| 1 | 1420 | 16.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7132 | |
| 1 | 1420 | 16.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7132 | |
| 1 | 1420 | 16.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7132 | |
| 1 | 1420 | 16.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7132 | |
| 1 | 1420 | 16.6% |
Oceania
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 0 | |
|---|---|
| 1 | 202 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8350 | |
| 1 | 202 | 2.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 8350 | |
| 1 | 202 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8350 | |
| 1 | 202 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8350 | |
| 1 | 202 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8350 | |
| 1 | 202 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8350 | |
| 1 | 202 | 2.4% |
South America
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 0 | |
|---|---|
| 1 | 104 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 1 | 104 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 1 | 104 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 1 | 104 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 1 | 104 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 1 | 104 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 1 | 104 | 1.2% |
Africa
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 391.7 KiB |
| 0 | |
|---|---|
| 1 | 67 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8552 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8485 | |
| 1 | 67 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 8485 | |
| 1 | 67 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8485 | |
| 1 | 67 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8552 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8485 | |
| 1 | 67 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8552 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8485 | |
| 1 | 67 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8485 | |
| 1 | 67 | 0.8% |
Year
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2006.3337 |
| Minimum | 1920 |
|---|---|
| Maximum | 2023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 358.3 KiB |
Quantile statistics
| Minimum | 1920 |
|---|---|
| 5-th percentile | 1974 |
| Q1 | 1999 |
| median | 2011 |
| Q3 | 2018 |
| 95-th percentile | 2022 |
| Maximum | 2023 |
| Range | 103 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 16.026195 |
|---|---|
| Coefficient of variation (CV) | 0.0079878011 |
| Kurtosis | 2.7768258 |
| Mean | 2006.3337 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -1.5426722 |
| Sum | 17158166 |
| Variance | 256.83891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2022 | 548 | 6.4% |
| 2018 | 408 | 4.8% |
| 2023 | 395 | 4.6% |
| 2019 | 393 | 4.6% |
| 2021 | 391 | 4.6% |
| 2017 | 358 | 4.2% |
| 2020 | 352 | 4.1% |
| 2016 | 319 | 3.7% |
| 2015 | 280 | 3.3% |
| 2014 | 278 | 3.3% |
| Other values (90) | 4830 |
| Value | Count | Frequency (%) |
| 1920 | 1 | < 0.1% |
| 1921 | 1 | < 0.1% |
| 1922 | 1 | < 0.1% |
| 1925 | 2 | < 0.1% |
| 1927 | 2 | < 0.1% |
| 1928 | 1 | < 0.1% |
| 1930 | 1 | < 0.1% |
| 1931 | 5 | |
| 1932 | 3 | |
| 1933 | 3 |
| Value | Count | Frequency (%) |
| 2023 | 395 | |
| 2022 | 548 | |
| 2021 | 391 | |
| 2020 | 352 | |
| 2019 | 393 | |
| 2018 | 408 | |
| 2017 | 358 | |
| 2016 | 319 | |
| 2015 | 280 | |
| 2014 | 278 |
weighted_average
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 7849 |
|---|---|
| Distinct (%) | 91.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5498904 |
| Minimum | 5.4038125 |
|---|---|
| Maximum | 7.6334925 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 391.7 KiB |
Quantile statistics
| Minimum | 5.4038125 |
|---|---|
| 5-th percentile | 5.974818 |
| Q1 | 6.3132446 |
| median | 6.5354654 |
| Q3 | 6.7889355 |
| 95-th percentile | 7.1370983 |
| Maximum | 7.6334925 |
| Range | 2.22968 |
| Interquartile range (IQR) | 0.47569093 |
Descriptive statistics
| Standard deviation | 0.35066194 |
|---|---|
| Coefficient of variation (CV) | 0.05353707 |
| Kurtosis | -0.20146505 |
| Mean | 6.5498904 |
| Median Absolute Deviation (MAD) | 0.23648723 |
| Skewness | 0.048720438 |
| Sum | 56014.662 |
| Variance | 0.12296379 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.535465388 | 27 | 0.3% |
| 6.491058874 | 7 | 0.1% |
| 6.342978493 | 7 | 0.1% |
| 6.300293913 | 4 | < 0.1% |
| 6.429759212 | 4 | < 0.1% |
| 6.345054157 | 4 | < 0.1% |
| 6.269008771 | 4 | < 0.1% |
| 6.55810872 | 4 | < 0.1% |
| 6.615454909 | 4 | < 0.1% |
| 6.408128184 | 4 | < 0.1% |
| Other values (7839) | 8483 |
| Value | Count | Frequency (%) |
| 5.403812462 | 1 | |
| 5.509902804 | 1 | |
| 5.512348571 | 1 | |
| 5.513940925 | 1 | |
| 5.544676306 | 1 | |
| 5.549962091 | 1 | |
| 5.558572425 | 1 | |
| 5.562033603 | 1 | |
| 5.563070173 | 1 | |
| 5.564724316 | 1 |
| Value | Count | Frequency (%) |
| 7.63349248 | 1 | |
| 7.618428975 | 1 | |
| 7.616375776 | 1 | |
| 7.611164562 | 1 | |
| 7.593800538 | 1 | |
| 7.585348039 | 1 | |
| 7.575562315 | 1 | |
| 7.569528587 | 1 | |
| 7.556874884 | 1 | |
| 7.55625751 | 1 |
| Popularity | VoteAverage | VoteCount | Budget | RunTime | Revenue | Year | weighted_average | OriginalLanguage | North America | Europe | Asia | Oceania | South America | Africa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Popularity | 1.000 | 0.128 | 0.386 | 0.249 | 0.065 | 0.293 | 0.138 | 0.134 | 0.079 | 0.099 | 0.047 | 0.035 | 0.012 | 0.000 | 0.000 |
| VoteAverage | 0.128 | 1.000 | 0.238 | -0.023 | 0.286 | 0.085 | -0.053 | 0.989 | 0.192 | 0.134 | 0.027 | 0.140 | 0.008 | 0.021 | 0.028 |
| VoteCount | 0.386 | 0.238 | 1.000 | 0.671 | 0.290 | 0.721 | -0.246 | 0.237 | 0.428 | 0.424 | 0.053 | 0.293 | 0.033 | 0.077 | 0.032 |
| Budget | 0.249 | -0.023 | 0.671 | 1.000 | 0.311 | 0.781 | -0.249 | -0.029 | 0.371 | 0.383 | 0.022 | 0.211 | 0.000 | 0.058 | 0.000 |
| RunTime | 0.065 | 0.286 | 0.290 | 0.311 | 1.000 | 0.324 | -0.055 | 0.288 | 0.098 | 0.075 | 0.119 | 0.146 | 0.000 | 0.000 | 0.020 |
| Revenue | 0.293 | 0.085 | 0.721 | 0.781 | 0.324 | 1.000 | -0.300 | 0.080 | 0.320 | 0.336 | 0.122 | 0.145 | 0.063 | 0.054 | 0.000 |
| Year | 0.138 | -0.053 | -0.246 | -0.249 | -0.055 | -0.300 | 1.000 | -0.058 | 0.157 | 0.191 | 0.137 | 0.110 | 0.033 | 0.052 | 0.023 |
| weighted_average | 0.134 | 0.989 | 0.237 | -0.029 | 0.288 | 0.080 | -0.058 | 1.000 | 0.178 | 0.136 | 0.032 | 0.134 | 0.008 | 0.017 | 0.009 |
| OriginalLanguage | 0.079 | 0.192 | 0.428 | 0.371 | 0.098 | 0.320 | 0.157 | 0.178 | 1.000 | 0.787 | 0.146 | 0.569 | 0.072 | 0.123 | 0.030 |
| North America | 0.099 | 0.134 | 0.424 | 0.383 | 0.075 | 0.336 | 0.191 | 0.136 | 0.787 | 1.000 | 0.291 | 0.487 | 0.017 | 0.086 | 0.033 |
| Europe | 0.047 | 0.027 | 0.053 | 0.022 | 0.119 | 0.122 | 0.137 | 0.032 | 0.146 | 0.291 | 1.000 | 0.177 | 0.000 | 0.000 | 0.026 |
| Asia | 0.035 | 0.140 | 0.293 | 0.211 | 0.146 | 0.145 | 0.110 | 0.134 | 0.569 | 0.487 | 0.177 | 1.000 | 0.022 | 0.026 | 0.017 |
| Oceania | 0.012 | 0.008 | 0.033 | 0.000 | 0.000 | 0.063 | 0.033 | 0.008 | 0.072 | 0.017 | 0.000 | 0.022 | 1.000 | 0.000 | 0.013 |
| South America | 0.000 | 0.021 | 0.077 | 0.058 | 0.000 | 0.054 | 0.052 | 0.017 | 0.123 | 0.086 | 0.000 | 0.026 | 0.000 | 1.000 | 0.000 |
| Africa\r\r | 0.000 | 0.028 | 0.032 | 0.000 | 0.020 | 0.000 | 0.023 | 0.009 | 0.030 | 0.033 | 0.026 | 0.017 | 0.013 | 0.000 | 1.000 |
| OriginalLanguage | OriginalTitle | Overview | Popularity | ReleaseDate | Title | VoteAverage | VoteCount | Budget | TagLine | RunTime | Revenue | Genres | North America | Europe | Asia | Oceania | South America | Africa | Year | weighted_average | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Id | |||||||||||||||||||||
| 27205 | 1 | Inception | skilled thief corporate espionage subconscious target chance regain old life payment task considered impossible inception implantation another person idea target subconscious | 4.401216 | 2010-07-15 | Inception | 8.4 | 10.452418 | 18.890684 | Your mind is the scene of the crime. | 148 | 20.531540 | Action, ScienceFiction, Adventure | 1 | 1 | 0 | 0 | 0 | 0 | 2010 | 7.611165 |
| 497698 | 1 | Black Widow | also known black widow part ledger dangerous conspiracy tie past force stop nothing bring must deal history spy broken relationship left wake long avenger | 4.399596 | 2021-07-07 | Black Widow | 7.3 | 9.142918 | 19.113828 | Her world. Her secrets. Her legacy. | 134 | 19.755027 | Action, Adventure, ScienceFiction | 1 | 0 | 0 | 0 | 0 | 0 | 2021 | 6.951345 |
| 603 | 1 | The Matrix | century matrix tell story computer hacker join group underground insurgent fighting vast powerful computer rule earth | 4.393980 | 1999-03-30 | The Matrix | 8.2 | 10.084350 | 17.958645 | Welcome to the Real World. | 136 | 19.954354 | Action, ScienceFiction | 1 | 0 | 0 | 0 | 0 | 0 | 1999 | 7.481176 |
| 843794 | 0 | 정이 | uninhabitable earth outcome civil war hinge brain elite soldier create robot mercenary | 4.393819 | 2023-01-12 | JUNG_E | 6.2 | 6.238325 | 0.000000 | AI Combat Warrior Will be Unleashed. | 98 | 0.000000 | ScienceFiction | 0 | 0 | 1 | 0 | 0 | 0 | 2023 | 6.384944 |
| 446893 | 1 | Trolls World Tour | queen poppy branch make surprising discovery — troll world beyond distinct difference create big clash various tribe mysterious threat put troll across land danger poppy branch band friend must embark epic quest create harmony among troll unite certain doom | 4.393436 | 2020-03-11 | Trolls World Tour | 7.3 | 7.567346 | 18.315320 | Happiest. Movie. Ever. | 90 | 17.712964 | Family, Animation, Comedy, Fantasy, Adventure, Music | 1 | 0 | 0 | 0 | 0 | 0 | 2020 | 6.915282 |
| 729854 | 0 | 콘크리트 유토피아 | world reduced rubble massive earthquake one know sure far ruin stretch because earthquake may heart one apartment building left standing apartment | 4.390862 | 2023-08-09 | Concrete Utopia | 7.6 | 2.708050 | 0.000000 | We believe we are chosen | 130 | 0.000000 | Thriller, ScienceFiction, Drama | 0 | 0 | 1 | 0 | 0 | 0 | 2023 | 6.813379 |
| 24021 | 1 | The Twilight Saga: Eclipse | find surrounded danger string mysterious killing malicious vampire quest revenge midst forced choose love friendship knowing decision potential ignite ageless struggle vampire werewolf graduation quickly approaching important decision life | 4.385907 | 2010-06-23 | The Twilight Saga: Eclipse | 6.2 | 9.018938 | 18.035018 | It all begins... with a choice. | 124 | 20.364433 | Adventure, Fantasy, Drama, Romance | 1 | 0 | 0 | 0 | 0 | 0 | 2010 | 6.354121 |
| 1880 | 1 | Red Dawn | dawn world war group band together defend town — and country — from soviet force | 4.384972 | 1984-08-10 | Red Dawn | 6.3 | 6.552508 | 16.648724 | In our time, no foreign army has ever occupied American soil. Until now. | 114 | 17.462956 | Action, Thriller, War, Drama | 1 | 0 | 0 | 0 | 0 | 0 | 1984 | 6.426945 |
| 4257 | 1 | Scary Movie 4 | find house life little boy go quest find also alien tripod world uncover secret order stop | 4.384935 | 2006-04-12 | Scary Movie 4 | 5.5 | 8.007700 | 17.622173 | Bury the grudge. Burn the village. See the saw. | 83 | 18.998768 | Comedy | 1 | 0 | 0 | 0 | 0 | 0 | 2006 | 6.006412 |
| 28676 | 0 | Schiave bianche: violenza in Amazzonia | young woman seek vengeance find love parent taken prisoner indigenous tribe | 4.383575 | 1985-08-09 | Amazonia: The Catherine Miles Story | 6.1 | 5.036953 | 0.000000 | Only one thing kept her alive. | 90 | 0.000000 | Adventure, Drama, Horror | 0 | 1 | 0 | 0 | 0 | 0 | 1985 | 6.362782 |
| OriginalLanguage | OriginalTitle | Overview | Popularity | ReleaseDate | Title | VoteAverage | VoteCount | Budget | TagLine | RunTime | Revenue | Genres | North America | Europe | Asia | Oceania | South America | Africa | Year | weighted_average | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Id | |||||||||||||||||||||
| 11092 | 1 | Presumed Innocent | rusty deputy prosecutor engaged obsessive affair soon he accused crime fight clear name becomes whirlpool lie hidden passion | 2.569095 | 1990-07-27 | Presumed Innocent | 6.8 | 6.388561 | 16.906553 | Some people would kill for love | 127 | 19.215044 | Mystery, Crime, Thriller | 1 | 0 | 0 | 0 | 0 | 0 | 1990 | 6.655719 |
| 664423 | 1 | The Windermere Children | story project rehabilitate child survivor holocaust shore lake | 2.568941 | 2020-01-27 | The Windermere Children | 7.5 | 4.564348 | 0.000000 | NaN | 88 | 0.000000 | Drama, TvMovie, History | 0 | 1 | 0 | 0 | 0 | 0 | 2020 | 6.895458 |
| 3077 | 1 | Son of Frankenstein | one son late henry find father ghoulish creation coma find monster bent revenge | 2.568941 | 1939-01-13 | Son of Frankenstein | 6.7 | 5.323010 | 12.948010 | The black shadows of the past bred this half-man . . . half-demon ! . . . creating a new and terrible juggernaut of destruction ! | 99 | 0.000000 | Horror, ScienceFiction | 1 | 0 | 0 | 0 | 0 | 0 | 1939 | 6.602898 |
| 413543 | 0 | Dear Zindagi | unconventional thinker help budding cinematographer gain new perspective life | 2.568865 | 2016-11-23 | Dear Zindagi | 7.1 | 5.347108 | 15.274126 | NaN | 151 | 15.032313 | Drama, Romance | 0 | 0 | 1 | 0 | 0 | 0 | 2016 | 6.767451 |
| 14400 | 0 | Largo Winch | powerful billionaire secret adoptive son must race prove legitimacy find father killer stop taking financial empire | 2.568865 | 2008-12-17 | The Heir Apparent: Largo Winch | 6.0 | 6.186209 | 17.050762 | NaN | 108 | 0.000000 | Adventure, Drama, Action, Thriller | 0 | 1 | 1 | 0 | 0 | 0 | 2008 | 6.296317 |
| 2749 | 1 | 15 Minutes | eastern criminal come new york city pick share score steal video camera start activity legal illegal learn medium circus make remorseless killer look like victim make rich target homicide detective fire marshal warsaw cop investigating murder former criminal partner everything sell local tabloid show top story | 2.568865 | 2001-03-01 | 15 Minutes | 5.9 | 6.470800 | 17.909855 | America Likes to Watch | 120 | 17.847270 | Action, Crime, Thriller | 1 | 1 | 0 | 0 | 0 | 0 | 2001 | 6.244575 |
| 11128 | 1 | Ladder 49 | watchful eye mentor captain mike probationary jack seasoned veteran fire station however jack crossroad sacrifice he made put harm way innumerable time significantly impacted relationship wife | 2.568788 | 2004-10-01 | Ladder 49 | 6.4 | 6.561031 | 17.909855 | Their greatest challenge lies in rescuing one of their own | 115 | 18.126869 | Drama, Action, Thriller | 1 | 0 | 0 | 0 | 0 | 0 | 2004 | 6.472989 |
| 484482 | 0 | Le Grand Bain | suffering depression last two year barely able keep head water despite medication gulp day every day wife encouragement unable find meaning life curiously end finding sense purpose swimming pool joining swimming team | 2.568712 | 2018-10-24 | Sink or Swim | 6.9 | 7.242798 | 0.000000 | NaN | 122 | 0.000000 | Drama, Comedy | 0 | 1 | 0 | 0 | 0 | 0 | 2018 | 6.712571 |
| 453755 | 1 | Arctic | man arctic finally receive long rescue however tragic accident opportunity lost must decide whether remain relative safety camp embark deadly trek unknown potential salvation | 2.568712 | 2018-11-21 | Arctic | 6.5 | 6.998510 | 14.508658 | Survival is the only option | 98 | 15.226498 | Drama | 1 | 1 | 0 | 0 | 0 | 0 | 2018 | 6.518539 |
| 54518 | 1 | Justin Bieber: Never Say Never | tell story canada hair smile voice chronicle unprecedented rise fame way street canada video selling square garden new york headline act world tour feature usher scooter la men cyrus smith family member part crew huge mix interview guest performance theater around world highest concert movie time beating previous record | 2.568712 | 2011-02-11 | Justin Bieber: Never Say Never | 5.2 | 5.934894 | 16.380460 | Find out what's possible if you never give up. | 105 | 18.405567 | Music, Documentary, Family | 1 | 0 | 0 | 0 | 0 | 0 | 2011 | 5.952678 |